A novel energy distribution comparison approach for robust speech spectrum vector quantization

نویسندگان

  • Ahmed Ismail
  • Yasser Dakroury
  • Hazem Abbas
چکیده

Vector Quantization (VQ) has been extensively used in speech vocoders. The training process normally requires a very large training-set. This paper introduces a novel energy distribution comparison distortion measure for the high-band speech spectrum that enables the vector quantizer to operate given a relatively small training-set. This measure has been used in the construction of a segmental vocoder using the pitch period as segments. A description of the proposed approach, the Energy-Mass distortion measure, is given and compared to the use of MFCC as a distortion measure showing the ability of the proposed approach to better represent the speech formants, when operating under the small training-set constraint. Finally, the performance of the new Energy-Mass is evaluated using the Spectral Distortion (SD). Speech quality perceived by the receiver is evaluated using the recently standardized objective quality measure PESQ, where an improvement of 0.3 PESQ score was obtained.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust speech recognition using missing feature theory and vector quantization

This paper addresses the problem of speech recognition in noisy conditions when low complexity is required like in embedded systems. In such systems, vector quantization is generally used to reduce the complexity of the recognition systems (e.g. HMMs). A novel approach for vector quantization based on the missing data theory is proposed. This approach allows to increase the robustness of the sy...

متن کامل

Robust speech mode based LSF vector quantization for low bit rate coders

Robust vector quantization of LSF parameters at a low bit rate is essential for voice coders operating below 5 Kbps. A novel aspect of the proposed technique is the use of decorrelated residual LSF vectors from speech mode based backward prediction along with a multi-stage VQ design. Rates as low as 12 bits per 20 ms speech frame for the stationary voiced speech mode and 22 bits/frame for unvoi...

متن کامل

Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder

Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).

متن کامل

Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder

Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007